Picture for Bin Xie

Bin Xie

Towards Robust Process Reward Modeling via Noise-aware Learning

Add code
Jan 19, 2026
Viaarxiv icon

R$^2$PO: Decoupling Training Trajectories from Inference Responses for LLM Reasoning

Add code
Jan 17, 2026
Viaarxiv icon

MaskMed: Decoupled Mask and Class Prediction for Medical Image Segmentation

Add code
Nov 19, 2025
Viaarxiv icon

SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation

Add code
Nov 12, 2025
Viaarxiv icon

MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation

Add code
Aug 26, 2025
Figure 1 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 2 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 3 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 4 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Viaarxiv icon

GeoVLA: Empowering 3D Representations in Vision-Language-Action Models

Add code
Aug 12, 2025
Figure 1 for GeoVLA: Empowering 3D Representations in Vision-Language-Action Models
Figure 2 for GeoVLA: Empowering 3D Representations in Vision-Language-Action Models
Figure 3 for GeoVLA: Empowering 3D Representations in Vision-Language-Action Models
Figure 4 for GeoVLA: Empowering 3D Representations in Vision-Language-Action Models
Viaarxiv icon

From Outcomes to Processes: Guiding PRM Learning from ORM for Inference-Time Alignment

Add code
Jun 14, 2025
Figure 1 for From Outcomes to Processes: Guiding PRM Learning from ORM for Inference-Time Alignment
Figure 2 for From Outcomes to Processes: Guiding PRM Learning from ORM for Inference-Time Alignment
Figure 3 for From Outcomes to Processes: Guiding PRM Learning from ORM for Inference-Time Alignment
Figure 4 for From Outcomes to Processes: Guiding PRM Learning from ORM for Inference-Time Alignment
Viaarxiv icon

GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent

Add code
May 22, 2025
Viaarxiv icon

Rethinking Timesteps Samplers and Prediction Types

Add code
Feb 04, 2025
Figure 1 for Rethinking Timesteps Samplers and Prediction Types
Figure 2 for Rethinking Timesteps Samplers and Prediction Types
Figure 3 for Rethinking Timesteps Samplers and Prediction Types
Figure 4 for Rethinking Timesteps Samplers and Prediction Types
Viaarxiv icon

RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2

Add code
Feb 04, 2025
Figure 1 for RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2
Figure 2 for RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2
Figure 3 for RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2
Figure 4 for RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2
Viaarxiv icon